AITopics | Preah Sihanouk Province

Collaborating Authors

Preah Sihanouk Province

'100 Video Calls Per Day': Models Are Applying to Be the Face of AI Scams

WIREDMar-16-2026, 09:00:00 GMT

'100 Video Calls Per Day': Models Are Applying to Be the Face of AI Scams Dozens of Telegram channels reviewed by WIRED include job listings for "AI face models." The (mostly) women who land these gigs are likely being used to dupe victims out of their money. "I can speak fluent English, I can speak good Chinese, I also speak Russian and Turkish," the glamorous, 24-year-old Uzbekistani woman explains in a selfie-style video made for recruiters. Angel had arrived in the Cambodian city of Sihanoukville that day, she said, and was ready to start work immediately. Those impressive language skills, however, have likely been put to use as part of elaborate " pig-butchering " scams targeting Americans.

artificial intelligence, social media, spam filtering, (12 more...)

WIRED

Country:

Asia > Cambodia > Preah Sihanouk Province > Sihanoukville (0.24)
Asia > Middle East > Iran (0.16)
North America > United States > California (0.14)
(13 more...)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Law > Criminal Law (0.69)
Government > Regional Government (0.69)

Technology:

Information Technology > Security & Privacy > Spam Filtering (0.70)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.60)
Information Technology > Communications > Social Media (0.52)

Add feedback

Hierarchical Memory Organization for Wikipedia Generation

Yu, Eugene J., Zhu, Dawei, Song, Yifan, Wong, Xiangyu, Zhang, Jiebin, Shi, Wenxuan, Li, Xiaoguang, Liu, Qun, Li, Sujian

arXiv.org Artificial IntelligenceJul-1-2025

Generating Wikipedia articles autonomously is a challenging task requiring the integration of accurate, comprehensive, and well-structured information from diverse sources. This paper introduces the Memory Organization-based Generation (MOG) framework, a novel approach to address these challenges by leveraging a hierarchical memory architecture. MOG extracts fine-grained memory units from web documents, recursively organizes them into a Wikipedia-style hierarchical structure, and uses this structure to guide the generation process. This ensures alignment between memory and the article outline, improving both informativeness and verifiability while minimizing hallucinations. Additionally, a citation module is implemented to enhance traceability by linking every generated sentence to specific memory units. Evaluations on our newly created WikiStart dataset demonstrate that MOG outperforms baseline methods in producing informative and reliable articles, making it particularly robust in real-world scenarios.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.23393

Country:

Asia > Philippines (0.15)
Asia > Malaysia (0.14)
Asia > Singapore (0.06)
(16 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports > Olympic Games (0.68)
Consumer Products & Services > Travel (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval

Thuon, Nimol

arXiv.org Artificial IntelligenceJun-16-2024

The search engine process is crucial for document content retrieval. For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches. Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool. Even Google does not deliver high accuracy for Khmer content. Semantic search engines improve search results by employing advanced algorithms to understand various content types. With the rise in Khmer digital content such as reports, articles, and social media feedback enhanced search capabilities are essential. This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods. Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful keywords from user queries, performs precise matching, and provides the best matching offline documents and online URLs. We propose three semantic search frameworks: semantic search based on a keyword dictionary, semantic search based on ontology, and semantic search based on ranking. Additionally, we developed tools for data preparation, including document addition and manual keyword extraction. To evaluate performance, we created a ground truth dataset and addressed issues related to searching and semantic search. Our findings demonstrate that understanding search term semantics can lead to significantly more accurate results.

keyword, search engine, search result, (8 more...)

arXiv.org Artificial Intelligence

2406.0932

Country:

Asia > Cambodia > Phnom Penh Province > Phnom Penh (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium (0.04)
(7 more...)

Genre: Research Report > New Finding (1.00)

Industry: Consumer Products & Services > Travel (0.30)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

EDIS: Entity-Driven Image Search over Multimodal Web Content

Liu, Siqi, Feng, Weixi, Fu, Tsu-jui, Chen, Wenhu, Wang, William Yang

arXiv.org Artificial IntelligenceOct-23-2023

Making image retrieval methods practical for real-world search applications requires significant progress in dataset scales, entity comprehension, and multimodal information fusion. In this work, we introduce \textbf{E}ntity-\textbf{D}riven \textbf{I}mage \textbf{S}earch (EDIS), a challenging dataset for cross-modal image search in the news domain. EDIS consists of 1 million web images from actual search engine results and curated datasets, with each image paired with a textual description. Unlike datasets that assume a small set of single-modality candidates, EDIS reflects real-world web image search scenarios by including a million multimodal image-text pairs as candidates. EDIS encourages the development of retrieval models that simultaneously address cross-modal information fusion and matching. To achieve accurate ranking results, a model must: 1) understand named entities and events from text queries, 2) ground entities onto images or text descriptions, and 3) effectively fuse textual and visual representations. Our experimental results show that EDIS challenges state-of-the-art methods with dense entities and a large-scale candidate set. The ablation study also proves that fusing textual features with visual features is critical in improving retrieval results.

information retrieval, machine learning, pattern recognition, (19 more...)

arXiv.org Artificial Intelligence

2305.13631

Country:

Europe > United Kingdom (0.14)
Asia > South Korea (0.14)
Asia > Cambodia > Preah Sihanouk Province > Sihanoukville (0.04)
(11 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (0.93)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
(2 more...)

Add feedback